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ABSTRACT 

Trans-splicing is a powerful approach to reprogram 
the genome. It can be used to replace 5', 3' or 
internal exons. The latter approach has been 
characterized by low efficiency, as the requirements 
to promote internal trans-splicing are largely 
uncharacterized. The trans-splicing process is 
induced by engineered 'RNA trans-splicing mol- 
ecules' (RTMs), which target a selected pre-mRNA 
to be reprogrammed via two complementary 
binding domains. To facilitate the development of 
more efficient RTMs for therapeutic applications 
we constructed a novel fluorescence based 
screening system. We incorporated exon 52 of the 
COL17A1 gene into a GFP-based cassette system 
as the target exon. This exon is mutated in many 
patients with the devastating skin blistering 
disease epidermolysis bullosa. In a double transfec- 
tion assay we were able to rapidly identify optimal 
binding domains targeted to sequences in the sur- 
rounding introns 51 and 52. The ability to replace 
exon 52 was then evaluated in a more endogenous 
context using a target containing COL17A1 exon 51- 
intron 51 -exon 52-intron 52-exon 53. Two selected 
RTMs produced significantly higher levels of GFP 
expression in up to 61% assayed cells. This novel 
approach allows for rapid identification of efficient 
RTMs for internal exon replacement. 

INTRODUCTION 

Conventional gene therapy strategies are designed to 
deliver the full-length cDNA copy of a therapeutic gene 
to produce a functional protein. However, these 
approaches have disadvantages that may reduce the 



chance of therapeutic success, including the difficulty to 
package and deliver large genes and the lack of endogen- 
ous regulation of the transgene expression in target cells. 
Moreover, autosomal dominant disorders may not be 
amenable to treatment by these conventional therapeutic 
approaches. These problems can be overcome by using a 
gene correction strategy, such as spHceosome mediated 
RNA ^ra^^-splicing. This pre-mRNA based method is 
characterized by the exploitation of naturally occurring 
splicing events during mRNA maturation (1,2). The en- 
dogenous splicing machinery is utilized to recombine two 
independent RNA molecules by trans-splicing, replacing 
the disease causing part of a gene by a wild-type version in 
an exon-wise manner. The ^ra/t^- splicing process is 
directed by engineered 'RNA ^ra^^- splicing molecules' 
(RTMs), which base-pair with a region of a target 
pre-mRNA via a complementary binding domain (BD). 
This interaction between the RTM and the target 
pre-mRNA confers specificity. Besides the BD, RTMs 
contain two other elements: a 5^ and/or a 3^ splice site 
that promote the ^ra^^- splicing reaction, and a wild-type 
coding region that can replace that region with any 
sequence ranging from a single exon to one or more 
genes, thus repairing the target gene or reprogramming 
it to encode any gene of interest. 

An RTM can be designed to replace 3^ 5^ or internal 
sequences of a target pre-mRNA to generate a new gene 
product (1). After binding to the target pre-mRNA, the 
^ra/7^-splicing process is initiated by the RTM, of which 
the spHcing and coding domains provide recognition se- 
quences for the spliceosome to facihtate what is essentially 
an alternative splicing reaction in trans. The efficiency of 
the ^ra^^- splicing process depends on the BD and the 
strength of the sphce site sequences of the RTM (3,4). 
There is a wide range of factors that influence the 
decision to splice (5). These factors remain largely unex- 
plored in the context of RNA ^ra^^-spHcing. 
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Most reports on RNA ^ra/t^- splicing for gene correction 
have employed y exon replacement to exchange down- 
stream coding sequences of a gene of interest, y trans- 
splicing therapy approaches have been successful in 
restoring potentially therapeutic levels of gene function 
in patient cells and in animal models of cystic fibrosis, 
X-linked immunodeficiency and hemophilia A (6-9). 

For correction of the upstream regions of a gene, 5^ 
^ra/7^-splicing was first reported using a double transfec- 
tion model to correct mutations expressed by a 
plasmid-encoded cystic fibrosis transmembrane receptor 
(CFTR) pre-mRNA. Those RTMs were designed to 
replace the 5^ portion of a transcript, inserting the 1.9 kb 
sequence coding for exon 1-10 of the CFTR gene (3). 
Similarly, 5^ trans-splicing has been used to replace 
(3-globin exon 1 in cells co-transfected with target and 
RTM expressing plasmids (10). Endogenous 5^ trans- 
splicing correction was first reported in 2008 by Wally 
et al., (11) who demonstrated that an RTM can correct 
over 1 kb of the upstream coding region of the PLEC 
gene. Restoration of normal plectin expression patterns 
were detected by immunofluorescence microscopy after 
RTM transfection of epidermolysis bullosa (EB) patient 
fibroblasts. Additionally, Wally et al. developed a 5^ trans- 
splicing strategy to specifically reprogram exons 1-7 of the 
keratin 14 gene (KRT14) in an autosomal dominant model 
of EB-simplex, leading to a phenotypic reversion of RTM 
treated cells in vitro (12). 

RNA ^ra^^-spHcing can also be used for internal exon 
replacement (lER) to correct internal coding regions of a 
gene by using the combination of 5^ and 3^ ^ra^^-spHcing 
(1,13,14). Here, an RTM with two BDs base-pairs up and 
downstream of the region to be replaced (i.e. a mutated 
exon), inducing the ^ra^^- splicing of a wild-type exon from 
the RTM to the naturally flanking exons of the endogen- 
ous target pre-mRNA. The main advantage of this 
method is the possibiHty to manufacture and deliver an 
RTM with a relatively short sequence into patient cells, 
thereby minimizing the size of the construct and many 
size-associated limitations. 

As a prototypic gene to perform lER, we analyzed the 
collagen 17A1 (C0L17A1) gene, which is mutated in the 
non-HerUtz junctional form of EB (15). EB is a genetically 
and clinically heterogeneous group of skin diseases 
characterized by blister formation and erosions of the 
skin and mucous membranes after minor trauma. 
To date, more than 10 different genes that encode for 
structural proteins within keratinocytes or mucocutaneous 
basement membranes have been linked to EB. The severity 
of the disease depends on which gene is mutated and the 
localization of the mutation within this gene (16,17). 

Here, we constructed a novel RTM screening system, in 
which the reporter molecule acGFP acts as marker for 
specific double trans-splicing events in the C0L17A1 
gene in vitro. The acGFP gene was split into three parts. 
The middle (internal) part was cloned into an RTM 
capable of performing two targeted ^r<2^^-splicing reac- 
tions. The 5^ and 3^ parts of GFP were cloned into the 
target molecule flanking the genomic target sequence of 
C0L17A1 intron 51-exon 52-intron 52. We show that this 
screening system can be used to rapidly identify those 



RTMs, which produce the highest levels of trans-splicing 
and function. 

MATERIAL AND METHODS 

Cell culture and transfection 

The human embryonic kidney cell line HEK293AD 
(Stratagene) was utilized for screening experiments. Cells 
were grown at 37° C and 5% CO2 in a humidified incuba- 
tor in DMEM supplemented with 10% FCS and lOOU/ml 
penicillin/streptomycin (Biochrom). Passaging of the cells 
was performed every 4-5 days using 1% Trypsin-EDTA 
(Biochrom), and cells were pelleted by centrifugation at 
llOg for 5min. Afterwards cells were seeded to a new 
tissue culture plate at the desired density. Cells were tran- 
siently transfected using jetPEI (Polyplus-transfection SA, 
Illkirch, France). Various amounts of plasmid DNA 
(1-10 |ig) were applied according to the manufacturer's 
protocol. 

Target molecule construction 

The engineered target molecule consists of a short 5^ 
region of acGFP sequence (183 bp), a C0L17A1 target 
region consisting of intron 51, exon 52, intron 52 and 
the 3^-coding region of acGFP (231 bp). The target was 
constructed by PCR ampHfication of the acGFP spHt 
parts (5^ acGFP forward primer: 5^-gatcggatcccaccatggt 
gagcaagggcgcc-3^ 5^acGFP reverse primer: 5^-tatgatatcac 
actcaccagggtgggccagggcac-3^ 3^acGFP forward primer: 
5^-gatcgcggccgcttttccctccaggtgaacttcaagatccgc-3^ 3^ 
acGFP reverse primer: 5^-gatctctagatcacttgtacagctcatcc-30 
and the target region intron 51-exon 52-intron 52 
C0L17A1 (intron 51 C0L17A1 forward primer: 5'-tatg 
atatcggtgccccgacggtgatgccccacc-3^ intron 52 C0L17A1 
reverse primer: 5^-gatcgcggccgcgtaaaacaccagagcttgggcaca 
gg-y) using Pfu Turbo DNA polymerase (Stratagene). 
The pacGFP vector (Clontech) and genomic DNA from 
a healthy donor was used as PCR template, respectively. 
The 5^ acGFP region was cloned into the expression vector 
pcDNA 3.1D/V5-HIS (Invitrogen), using the restriction 
enzymes BamHI and EcoRV. A 5^ spHce site (ag/ 
gtgagtgt) was included in the reverse amplification 
primer and cloned at the junction between the 5^ acGFP 
region and intron 51 of C0L17A1. The restriction 
enzymes EcoRV and NotI were used for cloning of the 
wild-type sequence of intron 51-exon 52-intron 52 of 
C0L17A1, followed by a y splice site (gcggccgcttttccctcc 
ag/g) and the y fragment of acGFP, which was cloned 
using NotI and Xbal sites. Plasmid preparations were 
carried out using the Plasmid Mini Prep Kit from 
Sigma-Aldrich (St. Louis, MO, USA), according to the 
manufacturer's protocol. All constructs and PCR 
products were sequenced using a 3130 ABI Prism auto- 
mated sequencer and ABI PRISM dye terminator cycle 
sequencing kits (Applied Biosystems, Foster City, CA, 
USA). PCR products were gel extracted before sequence 
analysis using the GFX™ PCR DNA and Gel Band 
Purification Kit (GE Healthcare). 
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COL17A1 mini-gene construction 

The C0L17A1 mini-gene target plasmid was constructed 
by cloning a PCR amplified product containing the 
C0L17A1 target region exon 51-intron 51-exon 
52-intron 52-exon 53 into the vector pcDNA 3. ID/ 
V5-HIS (Invitrogen). PCR was performed using an exon 

51 C0L17A1 specific forward primer (5^-gatcggatccc 
tgccggcttgtcattcatcccaggcc-30 and an exon 53 C0L17A1 
specific reverse primer (5^-gatctctagatttggaagaagtccatgagg 
tccgcagtcacgtt-30, Pfu Turbo DNA polymerase 
(Stratagene) and genomic DNA from an EB patient with 
a homozygous mutation (3898 del TC) in the C0L17A1 
gene. The resulting PCR product was cloned into the ex- 
pression vector using the restriction sites for BamHI and 
Xbal. 

Sequential construction to produce a double 
^rfl/i5-splicing RTM 

Selection of an initial 5' BD. A 5^ BD targeting the region 
of C0L17A1 exon 52/intron 52 was selected for the first 
step in construction of a double ^ra^^-spHcing RTM using 
a spHt GFP reporter model system according to the 
method of Wally et al. (12). In brief, the constructs were 
produced using the spHt reporter acGFP instead of 
dsRED. A 5^ BD Hbrary was constructed and ~40 
randomly picked clones were sequenced. Seventeen 
RTMs with BDs ranging from 78 to 314 nt that were com- 
plementary to essentially all regions of the exon 52-intron 

52 target were identified and tested for ^ra^^- splicing effi- 
ciency by quantifying GFP expression using flow 
cytometry. The most efficient BD was PCR ampHfied 
with primers containing flanking NotI sites. 

Initial double trans- splicing RTM construction. The double 
trans-^^\\Qmg RTM screening cassette vector was created 
in three steps. First, a pIRES2- AcGFP 1 vector (Clontech) 
was digested with EcoRI and KpnI. Elements of the y 
trans-?>^\\Qmg arm were introduced as double stranded 
synthesized oHgos into these sites, including an EcoRV 
site for later cloning in a BD Hbrary, a spacer and a 
branch point (underlined) (5^-gaattcgttaacgatatcgagaacatt 
attatagcgttgctcgag tactaac tggtacc-30. Second, the internal 
portion of acGFP (306 bp) was generated by PCR using 
Pfu Turbo DNA polymerase (Stratagene). 

The forward primer (5^-tatggtacctcttcttttttttctgcaggtga 
ccaccctgagctac-30 contains a Kpnl site and the remainders 
of the y splice site to generate a functional y RTM con- 
taining the same elements as previously described (18). 
The reverse primer (5^-tatgcggccgctgtaataatatcgcaacgagct 
ctcttaccttgatgccattcttggcc-30 contains a 5^ spHce site, a 
spacer region and a NotI restriction site. These two 
cloning steps removed the IRES and full-length GFP se- 
quences from the Clontech vector. Finally, the selected 
5^ BD was cloned into the NotI restriction site. The result- 
ing clones were sequenced to select one with the correct 
(complementary) orientation. 

Selection of a 3' binding domain. A y BD library was con- 
structed by fragmenting the PCR amplified target 



sequence of C0L17A1 intron 51 /exon 52 using CviJP 
(Roboklon; Berlin, Germany). 

Partial CviJP digestion of this 676 bp region was per- 
formed for 1-10 s at 37°C and the resulting fragments 
were pooled. CviJP cleaves and produces blunt end 
DNA fragments between the G and C at the sequence 
5^-PuGCPy-3^ The resulting fragments were cloned 
upstream of the y RTM spacer region into the EcoRV 
site. After cloning, 90 individual bacterial clones were 
checked for the presence of a BD fragment by colony 
PCR. DNA sequencing showed that ~95% of clones con- 
tained at least one BD, ~50% were complementary to the 
target region. Fifty-four RTMs harboring one or more 
complementary BDs were tested for ^ra^^-splicing effi- 
ciency quantifying GFP expression by flow cytometry. 
Geometric mean calculations were performed using the 
program FlowJo (Treestar). The most efficient BD target- 
ing intron 51 was chosen for further development. 

Selection of improved 5' BD. To improve the RTM 
selected above, an additional round of 5^ BD screening 
was performed. An EcoRV restriction site was introduced 
at the 5^ RTM spacer sequence downstream of the internal 
acGFP fragment by performing a second PCR ampHfica- 
tion of the acGFP fragment using an internal acGFP 
forward primer (5^-tatggtacctcttcttttttttctgcaggtgaccacc 
ctgagctac-30 and an internal acGFP reverse primer 
(5^-gatcgcggccgcgatatcttaataatatcgcaacgagctctcttaccttgatg 
ccattcttggcc-30. The resulting PCR product was subse- 
quently cloned into the RTM vector using the restriction 
enzymes Kpnl and NotI. The PCR ampHfied C0L17A1 
target region exon 52/intron 52 was CviJP digested as 
previously described and the resulting DNA fragments 
were cloned into the EcoRV site. Twenty-seven RTMs, 
harboring one or more complementary BDs for the 
target region were tested for ^ra^^-splicing efficiency by 
quantifying GFP expression using flow cytometry. Two 
low efficiency RTMs (RTM 1 and 3) and the two RTMs 
with the highest trans-^^Moing efficiency (RTM 2 and 4) 
were selected for further experiments. 

RNA isolation and cDNA synthesis 

RNA was isolated from 10^ cells using the RNeasy Mini 
Kit (Qiagen) according to the manufacturer's protocol. 
Purified RNA (1-2 |ig) was digested with DNase I 
(Sigma) for 30min at RT, which was subsequently 
inactivated by incubation for lOmin at 70°C, followed 
by cDNA synthesis using the iScript™cDNA Synthesis 
Kit (Biorad) with the provided mixture of oligo(dT) and 
random hexamer primers for cDNA synthesis. 

RT-PCR 

Exon replacement was detected by RT-PCR using GoTaq 
polymerase (Promega) and the following conditions: 95°C 
for 5min, 50 cycles of 30 s at 95°C, 30 s at 65°C, 30 s at 
72° C, and a final extension of 7min at 72° C, specific 
primers for acGFP (5^ acGFP specific forward primer: 
5^-gatcggatcccaccatggtgagcaagggcgcc-3^ and 3^acGFP 
specific reverse primer: 5^-cgccgatgggggtattctgctgg-30 and 
cDNA. Expected PCR products were the /r^^n^-splicing 
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product acGFP, the c/^-splicing product 5^acGFP-exon 
52-3^acGFP, and the fusion mRNA 5^acGFP-3^acGFP 
created by exon skipping on the target molecule. 
Co-transfection of single RTMs and the C0L17A1 
mini-gene target plasmid into HEK293AD cells 
produced two distinct splicing products, a ^ra^^-spliced 
product (exon 51-middle portion of acGFP-exon 53) 
and a c/^-spliced product (exon 51-exon 52-exon 53). 
These mRNAs were amplified by RT-PCR in one 
reaction using the C0L17A1 mini-gene specific primer 
pair (exon 51 C0L17A1 specific forward primer: 5^-ggag 
cgagctgatcagctacctcac-3^ and exon 53 C0L17A1 specific 
reverse primer: 5^-gccatcccttgcagtaggccctg-30 and the 
same conditions as described above. 

Flow cytometry 

Fluorescence microscopy and flow cytometric analysis was 
performed 2-4 days after RTM and target plasmids were 
co-transfected into HEK293AD cells. For cytometry, the 
transfected HEK293AD cells were washed with PBS 
buffer (Dulbecco's PBS), trypsinised to release the cells 
from the culture plates, and centrifuged for 5min at 
170^. The cell pellet was resuspended in 500-1000 |il 
PBS and applied to the Beckman Coulter FC500. 
Approximately 25 000-80 000 HEK293AD cells were 
analyzed for GFP reporter gene expression. The CXP 
software was used for data analysis. 

FACS sorting and western blot analysis 

Two days after HEK293AD cells were co-transfected with 
RTM and target expression plasmids, acGFP protein ex- 
pression was quantified by western blot analysis. Western 
blot analysis was performed using either total cell extracts 
from transfected HEK293AD cells or FACS selected GFP 
positive cells (Beckman Coulter Epics Altra). Cells were 
resuspended in lysis buffer (0.5 M Tris-HCl, pH 6.8, 20% 
Glycin, 10% SDS, 5% B-Mercaptoethanol) containing a 
protease inhibitor cocktail (Roche, Mannheim, Germany). 
After addition of 4x loading buffer and denaturation at 
95°C for 5min, SDS-gel-electrophoresis was performed at 
100-150 V for 1.5 h using a NuPAGE 10% or 4-12% Bis- 
Tris Gel (1.0 mm x 12 well) and a NuPAGE MOPS SDS 
running buffer (20 x) (Invitrogen). Post-electrophoresis, 
the SDS gel was equiUbrated in standard blotting buffer 
for up to 30 min at RT, and then the proteins were electro- 
blotted onto a nitrocellulose membrane (Amersham 
Hybon-ECL, GE Healthcare) for 75 min at 0.4 Ampere. 
After the protein transfer, the membrane was soaked in 
standard blocking buffer containing 1 % western blocking 
reagent (Roche) for 1 h at RT. First antibodies, anti-GFP 
rabbit IgG (Biomedica) and anti-Annexin I mouse mono- 
clonal IgG (Santa Cruz Biotechnology), were diluted 
1:1000 and 1:250 in TBS-T, respectively, containing 
0.5% western blocking reagent and incubated with the 
membrane over night at 4-8° C. The membrane was 
washed with TBS-T, then incubated with HRP labeled 
Envision^ anti-rabbit antibody (diluted 1:50-1:100, 
Dako) and HRP labeled Envision^ anti-mouse antibody 
(diluted 1:250-1:500, Dako) in TBS-T containing 0.5% 
Western blocking reagent. After washing with TBS-T the 



blot was visualized using the Immun-Star WesternC Kit 
(Biorad). Protein quantification was performed using the 
program Image Lab 3.0.1. (Biorad). 

RESULTS 

Rational design, construction and evaluation of 
RTM-BDs can be an arduous and time consuming 
process as the requirements for single or double trans- 
splicing are essentially unknown. To facihtate this 
process, a rapid method to identify the most efficient 
^ra^^- splicing RTMs is desirable. We have developed a 
screening system that can be used to select the most effi- 
cient double ^ra^^-splicing RTMs from a Hbrary contain- 
ing numerous RTMs with different BDs that are 
complementary to the target region. 

Selection of an initial 5^ BD 

To begin, we decided to identify a working 5^ BD targeting 
C0L17A1 exon 52 / intron 52 using the screening system 
previously published by Wally et al. (12). To minimize the 
complexity of screening and selecting for two BDs simul- 
taneously, we chose to proceed in a sequential manner, 
selecting one BD at a time. Seventeen RTMs specific for 
exon/intron 52 were individually tested for ^ra^^-spHcing 
induced restoration of GFP expression by flow cytometry 
(data not shown). The BD of the 5^ RTM that generated 
the highest level of GFP expression was incorporated into 
the 5^ trans-splicing arm of a double trans-splicing RTM 
vector (version 1). 

Screening for double ^ra/i5-splicing efficiency 

Since 3^ and 5^ ^ra^^-splicing have to work in synchrony, 
we feel that screening for the most efficient BDs for double 
trans-splicing should be done in the context of a double 
trans-splicing RTM. Our double ^ra^^-splicing fluores- 
cence based screening system utilizes two pre-mRNA com- 
ponents, a target molecule containing a disease associated 
region of the C0L17A1 gene and a Hbrary of RTMs that 
can potentially trans-spXicc and restore the coding 
sequence of a reporter gene flanking the target sequence 
(Figure 1). Tra^^-spHcing efficiency can be evaluated by 
restoration of reporter gene expression. For this screen, 
the reporter acGFP was divided into three portions and 
strategically cloned into the target molecule and the RTM. 
The 5^ and y parts of acGFP were cloned into the target 
vector flanking the C0L17A1 intron 51-exon 52-intron 52 
target region. The double ^ra/t^- splicing RTMs carry the 
middle portion of acGFP flanked by a y and a 5^ trans- 
spHcing arm, each consisting of a BD and a 5^ or y splice 
site. A diversity of RTMs, differing in their target specific 
BD sequences was created by fragmenting the target 
region of C0L17A1 and blunt end cloning of the resulting 
fragments into the RTM vector. 

Screening for an efficient 3^ BD 

A random y BD library targeting C0L17A1 intron 
51-exon 52 was constructed as described in the methods 
section and cloned into the double trans-spXicing 
RTM vector (version 1) harboring the initial working 
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Figure 1. Schematic description of the fluorescence based RTM screening system. (A) The fluorescence based RTM screen utihzes ceUs expressing 
both an exon specific target molecule and an RTM. Trans-s^Xicmg induced RNA recombination between the target and the RTM pre-mRNAs 
creates the fusion of all three acGFP fragments, thus restoring the full coding sequence and expression of functional acGFP. The target molecule 
contains exon 52 and the flanking introns of C0L17A1, along with 5' and 3' fragments of acGFP. The RTM contains the internal coding region of 
acGFP (the missing coding sequence) flanked by a 3' and 5' sphce site and other features required for the doublQ- trans-splicing reaction. Each RTM 
has two arms, containing an acceptor (3^ or donor (5^ splice site, a short spacer region and a 'random' BD created by fragmenting the C0L17A1 
target region. The randomized fragments were blunt ended, and cloned into the RTM backbone using the restriction site for EcoRV. (B) AcGFP 
expression was visualized by fluorescence microscopy in HEK293AD cells co-transfected with RTM and target molecule plasmids (40 x magnifica- 
tion). BP: Branch Point, pPT: Polypyrimidine Tract, SS: Splice Site, nt: nucleotides. 



5^ BD (Figure 2A). From a library of thousands of inde- 
pendent RTMs, 90 unique RTMs were sequenced and 54 
with complementary BDs were identified. Each of these 
RTMs was individually tested by co-transfection with 
target plasmid into HEK293AD cells and analyzed for 
trans-^^\\cmg efficiency by GFP expression using flow 
cytometry. After calculating the average GFP expression 
of RTM treated cells, the y BD of the most efficient RTM 
was chosen for further RTM improvement. 

Screening for an efficient 5^ BD 

The main aim of this additional screen was the further 
improvement of the existing RTM by screening for 
a new 5^ BD that works well together with the 
selected y BD specific for intron 51 and better than the 
initial 5^ BD specific for exon/intron 52 (Figure 2B). 
Twenty-seven different RTMs, with the selected y BD 
(Figure 3) and a variable BD for 5^ ^ra^^-splicing, were 
evaluated for efficient ^ra^^-splicing efficiency by flow 
cytometry. The geometric mean of GFP expression in 



RTM treated cells ranged from 2.6 to 3.5. Geometric 
mean calculation was performed only for RTMs that 
induced GFP expression in >5% of all analyzed cells. 
Five RTMs (RTM 0, 1, 2, 3, 4) from the 5^ double 
^r<3^^- splicing RTM library were selected as candidates 
for further experiments shown in this study (Figure 3). 
RTM 0 lacks a BD for the 5^ splicing reaction and 
was therefore included as negative control in the experi- 
ment. RTMs 1 and 3 were representatives for less func- 
tional RTMs, as the level of acGFP expression was 
decreased in comparison to the most efficient RTMs 2 
and 4. 



Exon replacement on mRNA level 

A C0L17A1 specific double ^ra^^-spHcing RTM is able to 
recombine with a target pre-mRNA by two synchronous 
splicing reactions (5^ and y). Thus the exon on the 
target molecule should be replaced by the middle portion 
of acGFP, restoring the open reading frame of acGFP 



el08 Nucleic Acids Research, 2011, Vol. 39, No. 16 



Page 6 of 1 1 



A target molecule & RTM version 1 



Intron 51 



Intron 52 



[ 5' acGFP 


5'SS 


3'SS 


Exon 52 


5'SS 


3'SS 


3' acGFP 








490nt 720nt 



3'SS 


int. acGFP 


5'SS II nil 


BP pPT 






184nt| 489nt COnStant 5 BD 


5* acGFP 


Int. acGFP 


3* acGFP ^ 


1nt acGFP fusion mRNA 720nt 



B target molecule & RTM version 2 



Intron 51 



Intron 52 



1 5' acGFP 


5'SS 


3'SS 


Exon 52 


5'SS 


3'SSP 







3* acGFP 



1nt I83nt 



490nt 720nt 



variable 5' BD 



1 3'SS 


^ 1 

int. acGFP 


5'SS 


r 






5' acGFP 



int. acGFP 



3' acGFP 



Int 



acGFP fusion mRNA 



720nt 



Figure 2. Sequential construction of a double /ra/i5-splicing RTM. Since y and 5' /ra/i5-splicing have to work in synchrony, we beheve that 
screening for the most efficient BDs for double /ra/i^-spHcing should be done in the context of a double /ra^^-splicing RTM. A random 3' BD 
library targeting C0L17A1 intron 51-exon 52 was constructed and cloned into the double trans-s^Mcing RTM vector (version 1) harboring the initial 
working 5' BD identified using the method of Wally et al. (A). To further improve the efficiency of the double /ra/i^-spHcing RTM (version 2), a new 
5' BD screen was performed using the efficient y BD identified in the 3' BD screen (A) and a variable 5' BD specific for the C0L17A1 exon/intron 
52 target region (B). 



leading to the expression of the reporter molecule in RTM 
treated cells. 

One day after transfecting double trans-^^\\cmg RTMs 
into HEK293AD cells, different spHcing patterns were 
observed by RT-PCR. C/^-splicing within the target 
molecule generated an mRNA consisting of the 5^ and y 
acGFP with exon 52 of C0L17A1 in the middle. This 
demonstrates that the acceptor (30 and donor (5^ splice 
sites at the exon / intron junctions were recognized by the 
endogenous splicing machinery and that the target introns 
were excised. The c/^-spHced 5^ acGFP-C(9L77^7 exon 
52-3^ acGFP fusion product was detected by RT-PCR 
ampHfication and agarose gel electrophoresis. This band 



was visible in all HEK293AD cells transfected with target 
plasmid (Figure 4, lanes 1, 4 and 5). Double ^ra^^-spHcing 
between the RTM and target molecule pre-mRNAs lead 
to the replacement of exon 52 of the target molecule by the 
internal acGFP sequence carried by the RTM. The result- 
ing full-length GFP mRNA was exclusively detected in 
HEK293AD cells co-transfected with both target and 
RTM expression plasmids (Figure 4, lanes 4 and 5). 
Since the products created by cis- or trans-splicing differ 
in size, discrimination by RT-PCR is possible, as shown in 
Figure 4, lanes 4 and 5, indicated bands. Skipping of exon 
52 was also detected in cells transfected with the target 
plasmid (Figure 4, lanes 1, 4 and 5). PCR products were 
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excised from the gel and splicing products were confirmed 
by DNA sequence analysis. The PCR primers used were 
designed to hybridize to the 5^ and y terminal regions of 
the target molecule. No ^ra^^-spHced products were 
detected in HEK293AD cells transfected exclusively with 
RTM (Figure 4, lanes 2 and 3). 



Intron 51 , , Intron 52 



RTMO a^^™ 
RTMl a^^H 

RTM 2 a^^H ^T^" 
RTM 3 a^^H ^ e^^H 

RTM 4 a^^H 

Figure 3. 3' and 5' BDs of selected RTMs. The BD sequences have a 
great influence on the efficiency of /ra/i5-spHcing. Five RTMs selected 
from an RTM hbrary with the same 3' BD for intron 51 (a; binding 
position from nt 106 of intron 51 to nt 257 of intron 51) of C0L17A1 
but with different BDs for the exon 52 — intron 52 target region were 
selected for further testing. RTM 0 had no 5'-BD, whereas RTM 2 and 
4 harbored two separate BDs specific for exon 52 — intron 52. One 
region of complementarity of RTM 2 masks the exon/intron 
boundary of exon 52 — intron 52 (c; binding position from nt 360 of 
exon 52 to nt 190 of intron 52). The second domain repeats some of 
same region of the 5'-end of intron 52 (d; binding position from nt 20 
to nt 63 of intron 52). RTM 4 binds to the central (f; binding position 
from nt 318 to nt 338 of intron 52) and 3' (g; binding position from nt 
457 to nt 555 of intron 52) portion of intron 52. RTMs 1 and 3 each 
have one 5' BD complementary to the central portion of intron 52 (b; 
nt 217 to nt 297 of intron 52, e; nt 191 to nt 297 of intron 52). 



Double /ra/7^-splicing into the C0L17A1 gene was 
modeled using a mini-gene target plasmid that was con- 
structed to express the endogenous pre-mRNA region 
from exon 51 through exon 53. Successful double trans- 
splicing should cause the replacement of C0L17A1 exon 
52 with the internal portion of acGFP (Figure 5A). 
HEK293AD cells were co-transfected with plasmids ex- 
pressing the C0L17A1 mini-gene target and a single 
RTM (1-4). Tra^^- splicing efficiency was evaluated by 
RT-PCR using RNA extracted from these cells one day 
post transfection. Tra/t^- splicing between the target exons 
and the middle portion of acGFP demonstrates the effi- 
ciency of the RTMs tested in the context of the endogen- 
ous C0L17A1 sequence (Figure 5B). Both splice products 
(cis and trans) were amplified by RT-PCR and visualized 
by agarose gel electrophoresis. These PCR products were 
gel purified and confirmed by DNA sequence analysis. 

Flow cytometric analysis of treated HEK293AD cells 

To detect acGFP repair and expression at the protein 
level, fluorescence microscopy and flow cytometry were 
performed on HEK293AD cells co-transfected with 
RTM and target plasmids. 

Two or more days post co-transfection with RTM and 
target molecule the cells were evaluated for acGFP expres- 
sion by fluorescence microscopy. The fraction of acGFP 
expressing cells and the expression level of GFP reporter 
was also quantified by flow cytometry. The sequences 
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Figure 4. Detection of acGFP restoration by double RNA /ra/t5-splicing on mRNA level. Trans-splicing on mRNA level can be detected by 
RT-PCR in HEK293AD cells co-transfected with RTM 4 or RTM 2 and target molecule expressing plasmids. C0L17A1 exon 52 replacement 
by the internal acGFP fragment contributed by the RTM was detected using a 5' acGFP and y acGFP specific primer pair to discriminate between 
cis- and trans-splicing events. The cDNA templates included: lane 1: target molecule alone, lane 2: RTM 2 plasmid only, lane 3: RTM 4 plasmid 
only, lane 4: target molecule + RTM 2 plasmids, lane 5: target molecule + RTM 4 plasmids, lane 6: water control. The c/5-splicing product 5'acGFP - 
exon 52 - 3'acGFP (lanes 1, 4, 5), the trans-splicing product acGFP (lanes 4 and 5; The marked band in lane 4, indicated by an arrow, was sequenced 
after recovery and gel extraction) or the exon skipping product 5' acGFP - 3' acGFP (lanes 1, 4, 5) was visualized on a 2% agarose gel after gel 
electrophoresis. Lane 7: DNA ladder mix (Fermentas). 



el08 Nucleic Acids Research, 2011, Vol. 39, No. 16 



Page 8 of 1 1 



COL17A1 mini-gene 



Intron 51 



Exon 51 


5'SS 










pi 









Exon 52 



Intron 52 
5'SS 3'SSl 



RTM 




3' frans-splicing 



5' frans-splicing 



COL17A1- acGFP fusion mRNA 



B 



mini-gene RTI\/I4 



RTIVI2 RTIVI4 RTIVI1 RTIVI3 
mini-gene mini-gene mini-gene mini-gene 




^Exon 51 -Exon 52-Exon 53 
^Exon 51 -int. acGFP-Exon 53 



Figure 5. Double RNA /ra/i5-splicing between RTM and C0L17A1 mini-gene. (A) A C0L17A1 mini-gene expression plasmid was constructed by 
cloning the endogenous gene region exon 51-intron 51-exon 52-intron 52-exon 53 of C0L17A1 into a pcDNA 3.1D/V5-HIS expression vector 
(Invitrogen). Co-transfection of the C0L17A1 mini-gene expression plasmid and a functional RTM generated RNA /ra?25-splicing between the 
middle portion of acGFP carried by the RTM to exon 51 and 53 of the target molecule. (B) One day after HEK293AD cells were co-transfected with 
either RTM 1 (lane 6), 2 (lane 4), 3 (lane 7), or 4 (lane 5) and C0L17A1 target mini-gene expression plasmids, double /ra^5-splicing was detected by 
RT-PCR. Using a C0L17A1 mini-gene specific primer pair (exon 51 C0L17A1 specific forward primer, exon 53 C0L17A1 specific reverse primer) it 
is possible to discriminate between the product of cw-spUcing (exon 51-exon 52-exon 53 = 442 bp) or /raw^-spHcing (exon 51 — middle acGFP 
fragment — exon 53 = 358 bp). No /ra^5-sphcing products were detected by RT-PCR of RNA from HEK293AD cells transfected with C0L17A1 
mini-gene expression plasmid (lane 2) or RTM 4 (lane 3) alone. Lane 1: DNA ladder mix (Fermentas). 



included as BDs have a profound effect on the efficiency 
of double ^ra^^-splicing RTMs as quantified by the 
number of GFP positive cells and the level of GFP 
produced. Compared with the less efficient RTMs 1 
(4.2% GFP positive cells) and 3 (12.8%), RTMs 2 and 4 
produced significantly higher levels of acGFP expression 
in up to 61% of transfected HEK293AD cells (Figure 6G). 
Background expression levels were less than 1% as 
detected in cells co-transfected with target and RTM ex- 
pression plasmid 0, which had no 5^ BD (Figure 6D). No 
acGFP expression was detected after single transfection 
with target molecule, RTM 2, or RTM 4 (Figure 6A-C). 
The geometric mean of RTM 1 and 3 was 2.17 and 2.53. 
The most efficient RTMs 2 and 4 had significantly higher 
values (geometric mean: 6.77 and 5.28) for the average 
GFP expression. 

Detection of acGFP by western blot analysis 

Western Blot analysis was performed to demonstrate that 
the portion of acGFP carried by the RTM /r^^w^- spliced 



twice into target pre-mRNA in treated cells and correctly 
assembled all three of the spHt parts of acGFP. The 
repaired acGFP should be detected at the same position 
as a full-length acGFP after immunostaining. 

Western blot analysis was performed at least two days 
after transfection using total cell extracts from transfected 
HEK293AD cells to quantify the restoration of full-length 
acGFP (27kDa) by ^ra^^-splicing (Figure 7). A full-length 
acGFP-expressing plasmid was used as positive control in 
the experiment (Figure 7, lane 2). AcGFP was visualized 
after blotting and immuno-staining of HEK293AD total 
protein extracts from cells co-transfected with target 
molecule along with either RTM 2, 3, or 4 (Figure 7, 
lanes 4, 6 and 7). RTMs 0 and 1 were not able to 
generate a detectable acGFP band (Figure 7, lanes 5, 8). 
These RTMs also produced less than ~5% GFP positive 
cells as quantified by flow cytometric analysis. No acGFP 
expression was detected after single transfection of target 
molecule, RTM 4 or RTM 2 alone into HEK293AD ceUs 
(Figure 7, lanes 9-11). HEK293AD cells co-transfected 
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acGFP acGFP 

Figure 6. Detection of acGFP expression by flow cytometry. Flow cytometric analysis of HEK293AD cells co-transfected with one RTM and target 
expression plasmids demonstrates the abihty to discriminate between weak and highly efficient RTMs. Single transfection of HEK293AD cells with 
target molecule plasmid (A), RTM 2 (B) or RTM 4 (C) did not generate any expression of acGFP. Transfection of target molecule in combination 
with RTM 0, which does not have a specific BD for intron 52, generates weak acGFP expression in ~0.7% of analyzed cells (D). The RTMs 1 (E) 
and 3 (F) generated acGFP expression in 4.2 and 12.8% of co-transfected cells. RTM 2 (G) and 4 (H) demonstrated the greatest /ra/i5-spHcing 
efficiency, up to 61% of co-transfected HEK293AD cells were green, and the geometric mean level of acGFP expression was significant greater. All 
RTMs harbor the same 3' BD specific for intron 51, but differ in the binding sequences complementary to intron 52 of C0L17A1 (Figure 3). 



with RTM 2 and target expression plasmids were also 
selected by FACS sorting with a purity of ~90% (data 
not shown). Removal of the untransfected cells increased 
the intensity of the acGFP band significantly (Figure 7, 
lane 3). 

DISCUSSION 

In this study we demonstrate that lER is a technology that 
has potential appHcations in treating genetic diseases. 



However, at present there are no rules or guidelines for 
designing highly efficient double trans-^^\\cmg RTMs. 
Building RTMs one at a time and testing them for effi- 
ciency is an arbitrary and arduous process. Thus we have 
developed a rapid screening system to screen for efficient 
BDs to two introns flanking a given exon. As a model 
system we chose to replace exon 52 of the C0L17A1 
gene, a gene mutated in the devastating disease of the 
skin and mucous membranes EE. 

The methodology of RNA ^ra/t^- splicing provides 
many important advantages over conventional gene 
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Figure 7. Detection of full-length acGFP restoration by western blot analysis. After co-transfection of target molecule and either RTM 2 (lane 6), 
4 (lane 7), or 3 (lane 4), acGFP protein (27kDa) was detected by western blot analysis. As positive control, total-cell extract from acGFP control 
vector (acGFP cloned in pcDNA 3.1D/V5-HIS vector, Invitrogen) transfected HEK293AD cells was also included (lane 2). Lane 3 demonstrates a 
significant increase (~3-fold) in the amount of acGFP protein in acGFP positive cells collected by FAGS sorting of HEK293AD cells co-transfected 
with RTM 2 and target expression plasmids (lane 3) compared to total cell extract from co-transfected HEK293AD cells without sorting (lane 6). No 
detectable acGFP expression was visible after single transfection of target molecule (lane 11), RTM 4 (lane 10) or RTM 2 (lane 9) into HEK293AD 
cells. No expression of acGFP was detected by western blot analysis of HEK293AD cells co-transfected with target molecule and either RTM 1 
(lane 5) or RTM 0 (which is missing a specific BD for intron 52, lane 8). Annexin I (37 kDa) was included as the loading control for this experiment. 
Lane 1: Precision Plus Protein WesternG Standards (Biorad). 



therapy approaches. Mutated RNA molecules can be 
reprogrammed to wild-type, thus reducing the amount 
of c/^-spliced disease associated gene expression. RNA 
^ra^^-splicing can be used to replace 5^ y or internal 
exon(s), referred to as 5^ y or lER. To date, several 
groups have shown functional RNA repair using either 
y or 5^ RNA ^ra^^-splicing in genetic diseases (6-12). 

Still, there are many distinct genetic conditions in which 
combining y and 5^ ^ra/i^- splicing might be advantageous 
or even the only feasible therapeutic strategy. Such situ- 
ations occur for example when the gene segment to be 
replaced by y or 5^ RNA ^ra^^-spHcing is too large to be 
delivered by existing vectors. lER permits the use of 
RTMs, which are only ~200nt larger than the exon(s) 
to be replaced. Double ^ra/t^- splicing RTMs do not need 
to deliver the full-length coding sequence of a gene, un- 
translated regions (UTRs) and start or termination 
codons. Alternatively spHced exons and regulatory se- 
quences within the 5^- and 3^-UTRs, which are essential 
for mRNA stability and translation regulation, are left 
intact in the target pre-mRNA. One such gene is PLEC, 
which has multiple mRNA isoforms some that exceed 
14 kb. Mutations in this gene lead to a subgroup of EB 
that is associated with progressive form of late onset 
muscular dystrophy (EBS-MD). In this case, 3^ or 5^ 
RNA trans-splicing repair can only be applied to a 
fraction of EB patients that carry mutations in exons 
located at the 5^- and 3^-end of the gene. The terminal 
exon of PLEC, exon 32, is >6.5kb by itself. Therefore, 
double ^ra/7^-splicing to replace the 3.38 kb coding 
sequence of exon 31, is required to effect delivery and 
repair of mutations in this major EBS-MD disease 
associated region. Additionally, the expression of PLEC 
isoforms, should not be altered, except for the correction 
of mutations within the targeted exon. 

Other promising target genes for lER are the EB 
associated genes C0L17A1 and C0L7A1. Both of these 
genes have been corrected using 3^ trans-splicing (6,18). 
In vitro studies by Murauer et al. demonstrated a pheno- 
typical correction of C0L7A1 in dystrophic EB (DEB) 



patient cells. Primary keratinocytes from a DEB patient 
were transduced with a retrovirus encoding a 3^ RTM 
designed to repair C0L7A1 exons 65-118. Significant 
levels of normal type VII collagen expression was 
restored. The treated keratinocytes secreted type VII 
collagen at the basement membrane zone in skin equiva- 
lents. A double RNA ^ra^^- splicing strategy may be 
needed to repair many of the internal exons (exons 
upstream of exon 65) of C0L7A1 as the 3^ trans-splicing 
strategy described above would require an RTM that 
exceeds the packaging limit of many delivery systems. 

A double ^ra^^-spHcing RTM is a tool that can replace 
one or more internal exons of a targeted pre-mRNA. The 
exons to be reprogrammed by the RTM are chosen on the 
basis of their cHnical relevance. The design of the other 
RTM elements is much more empiric. Alternative se- 
quences for the spacers, 3^ and 5^ splice sites can 
modulate ^r<2^^- splicing efficiency or specificity; however, 
the sequence of the BDs and their interactions with the 
pre-mRNA target clearly has a profound effect (1-3). At 
present the rules for building optimal double ^r<2^^- splicing 
RTMs are unknown. Therefore we created a fluorescence 
based screening system to rapidly identify the most effi- 
cient double trans-splicing RTMs from a library of RTMs 
with target specific, but random position and length of 
BDs. 

In the fluorescence based screening system, the repair of 
the reporter molecule acGFP acts as marker for specific 
double ^ra^^-spHcing events in co-transfected cells. Using 
flow cytometry, we were able to rapidly evaluate and 
compare the ^r<2^^- splicing efficiency of RTMs differing 
in their target binding sequences. Up to 60% of 
analyzed HEK293AD cells expressed high levels of 
acGFP after co-transfection with target and either 
double ^ra^^-splicing RTM 2 or RTM 4. Under the 
same co-transfection conditions, other RTMs with an 
identical 3^ BD but no or a different 5^ BD generated 
0.7-13% green cells. There are several possible explan- 
ations for the observed difference in efficiency: (i) It is 
likely that binding to the target molecule requires that 
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the target region is accessible, that it does not strongly 
interact with itself or other molecules in the nucleus; 
(ii) two discontinuous BDs may be more stable or more 
difficult to completely disrupt than a single BD; (iii) block- 
age of the exon/intron boundary of the target molecule, 
accomplished by the longer BD of RTM 2, may interfere 
with recognition of the exon 52 5^ splice site and facihtate 
^ra^^- splicing; and (iv) BDs with higher GC content may 
be more stable in binding to the target region. The BDs of 
RTM 2 (BD c = 61% G+C, BD d = 66%) and RTM 4 
(BD f = 57%, BD g = 56%) have a higher GC content in 
comparison to those of RTM 1 (47% G+C) and RTM 3 
(52%). The essential point is that the BDs for both trans- 
splicing reactions (3^ and 5^ must work together well for 
efficient trans-^^Mcmg replacement of the target exon. 
Immunostaining was performed to confirm the production 
of full-length 27-kDa acGFP protein. Very low-level ex- 
pression of acGFP was detected by flow cytometric 
analysis of HEK293AD cells co-transfected with target 
molecule and RTM 0, which has no specific 5^ BD. This 
may have been caused by the RTM performing a specific 
y ^ra^^- splicing reaction into the target and a relatively 
non-specific 5^ ^ra^^-spHcing reaction into the target, 
facilitated by the interaction of the single BD of the 
RTM with the target. We also created a ^ra^^-spHcing 
model system that more closely resembles the endogenous 
gene. A C0L17A1 target mini-gene was created with fully 
endogenous sequence from exon 51 through exon 53, with 
the intervening exon/intron boundaries. This model allows 
for further characterization and selection of double trans- 
splicing RTMs in a standard cell line (e.g. HEK293AD 
cells) prior to final selection in EB patient cells. Since 
there are no general rules for predicting optimal binding 
regions of a pre-mRNA of interest, the described RTM 
screen represents a promising tool to optimize gene cor- 
rection by spHceosome mediated RNA ^ra^^-spHcing. 
Different RTM-BDs and spacer region sequences can be 
rapidly evaluated by this screen. This should reduce time 
and effort required to select optimal RTMs prior to testing 
them in the more complex, but highly relevant endogenous 
gene expression model systems. In this report we have 
demonstrated that 5^ and 3^ RNA ^ra/t^- splicing can be 
combined as a promising tool to repair large genes or 
those that occur in multiple isoforms. Our fluorescence 
based RTM screening system should help to more 
rapidly optimize RNA correction by ^r<2^^- splicing to 
higher levels, which should have a greater probabiHty to 
correct genetic diseases. 
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